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Abstract — Spectrum management has been identified as a 
crucial step towards enabling the technology of the cognitive 
radio network (CRN). Most of the current works dealing with 
spectrum management in the CRN focus on a single task of 
the problem, e.g., spectrum sensing, spectrum decision, spectrum 
sharing or spectrum mobility. In this work, we argue that for 
certain network configurations, jointly performing several tasks 
of the spectrum management improves the spectrum efficiency. 
Specifically, we study the uplink resource management problem 
in a CRN where there exist multiple cognitive users (CUs) and 
access points (APs), with each AP operates on a set of non- 
overlapping channels. The CUs, in order to maximize their uplink 
transmission rates, have to associate to a suitable AP (spectrum 
decision), and to share the channels belong to this AP with other 
CUs (spectrum sharing). These tasks are clearly interdependent, 
and the problem of how they should be carried out efficiently 
and distributedly is still open in the literature. 

In this work we formulate this joint spectrum decision and 
spectrum sharing problem into a non-cooperative game, in which 
the feasible strategy of a player contains a discrete variable and 
a continuous vector. The structure of the game is hence very 
different from most non-cooperative spectrum management game 
proposed in the literature. We provide characterization of the 
Nash Equilibrium (NE) of this game, and present a set of novel 
algorithms that allow the CUs to distributively and efficiently 
select the suitable AP and share the channels with other CUs. 
Finally, we study the properties of the proposed algorithms as 
well as their performance via extensive simulations. 

I. Introduction 

A. Motivation and Related Work 

The problem of distributed spectrum management in the 
context of CRN has been under intensive research recently. 
As pointed out by the authors of O], the spectrum manage- 
ment needs to address four main tasks: 1) spectrum sensing, 
techniques that ensure CUs to find the unused spectrum for 
communication; 2) spectrum decision, protocols that enable 
the CUs to decide on the best set of channels; 3) spectrum 
sharing, schemes that allow different CUs to share the same set 
of channels; 4) spectrum mobility, rules that require the CUs to 
leave the channel if licensed users are detected. Many efforts 
have been devoted to providing solutions to the individual 
tasks listed above. However, as we will see in this paper, 
in some CRN scenarios, several of the above tasks become 
interdependent, and the CUs have to perform these tasks jointly 
to achieve best performance. In this work, we propose to 
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provide solutions for the joint spectrum decision and spectrum 
sharing problems in a multi-channel multi-user CRN. 

We focus on investigating an important CRN scenario where 
such joint spectrum decision and spectrum sharing is desirable. 
Consider a network with multiple CUs and APs, where the 
APs operate on different sets of channels, and the CUs need 
to connect to one of the APs for communication. The CUs can 
use multiple channels belong to the associated AP concurrently 
for transmission, but different CUs interfere with each other if 
they use the same channel. This network is a generalization of 
the single AP network considered in many previous literature, 
e.g., IJl and [S). In the considered network, the CUs face the 
spectrum decision problem when they select the AP, and they 
face the spectrum sharing problem when they try to dynami- 
cally allocate their communication power across the channels 
belong to the selected AP. Clearly, these two problems are 
strongly interdependent, as on the one hand a particular CU 
has to select an AP before it can share the spectrum that 
belongs to this AP with all the other CUs associated with 
it; on the other hand, after sharing the spectrum, an individual 
CU may have the incentive to switch to a different AP if 
it perceives that such action will increase its communication 
rate. A poor spectrum decision and spectrum sharing scheme 
will not only lead to unsatisfactory performance for individual 
CUs, but also result in an unstable system in which CUs are 
constantly unsatisfied with their current communication rates 
and consequently changing their AP associations and power 
allocation indefinitely. 

A similar problem related to the joint cell selection/base 
station (BS) association and power control has been addressed 
in infrastructure-based cellular networks. HI and [2] are early 
works trying to tackle this problem in an uplink spread 
spectrum cellular network. The objectives are to let the users 
find a best site selection and power allocation tuple such that 
all users' target signal to interference ratio (SIR) are met, and 
all users' transmission power is minimized. The authors of [l6l 
and Ul cast a similar problem (with an objective to maximize 
individual power efficiency or minimizing individual cost) into 
game theoretical frameworks, and propose algorithms to find 
the Nash Equilibrium (NE) of the proposed games. One of the 
most important differences between our work and the above 
cited works is that the power allocation problems in these 
works are essentially scalar value optimization problem; each 
user only needs to decide on its power level once a BS is 
selected, while in our work, individual power allocation is a 



vector optimization problem as the CUs have the flexibiUty to 
use all the channels belong to a particular AP concurrently. 
This fundamental difference makes the considered problem 
more complex, hence the analytical frameworks provided by 
the above cited works are not suitable for our problem. It 
can also be argued that the problem under consideration is 
also in many aspects more complicated than the traditional 
AP association problems arise in the 802.11 WLAN network 
(for example, JS), ||9l and ifTOl and the reference therein). 
Typically, AP association is aiming to optimize different 
system performance metrics (throughput, fairness, etc), and 
only simple individual throughput estimates within each AP 
are used to update the current association profile. Indeed, in 
802.11 WLAN network, the throughput of an individual AP 
with fixed number of users and fixed physical bit rate can be 
approximated using simple analytical formulae ifTTI . and this 
result has greatly simplified the analysis of many work dealing 
with dynamic AP association in WLAN, e.g., ilOl and i 12] . 

We also note here that the problem of how to dynamically 
perform the task of both spectrum decision and spectrum 
sharing may arise in other important CRN configurations 
as well. Most of the current works addressing the spectrum 
management problem in multi-channel multi-user CRN focus 
only on the spectrum sharing part of the problem. For example, 
in |[T3l . |[T4l . a set of iterative water-filling (IWF) based 
algorithms are proposed to find a distributed solution of power 
allocation in multi-channel, multi-user CRN with interference 
channels. One important assumption underlying these works 
is that the CUs are able to use all the channels simultaneously. 
However, this assumption might not be valid in the situation 
where the available spectrum is fragmented due to licensed 
user activities and where the CUs are equipped with 1 -agile 
radio which can only use a single set of continuously aligned 
channels at a time B In this scenario, the CUs need to select 
the set of channels to use, and decide on the allocation of 
the transmission power to the selected set of channels, i.e., 
the CUs are required to perform the task of joint spectrum 
decision and spectrum sharing. Although the problem of how 
to optimally perform such task has never been addressed in 
literature before, it is our belief that our work can also serve to 
shed some lights on providing solutions to it, as the network 
configuration considered in our work is sufficiently similar to 
the configuration mentioned above. 

B. Contributions and Organization of This Work 

To the best of our knowledge, this is the first work that 
propose distributed algorithms to deal with joint AP selection 
and power allocation problem in a multi-channel multi-AP 
CRN. We cast the problem into a non-cooperative game 
framework, in which each CU's objective is to maximize its 
own transmission rate, and its strategy space is the union of a 
discrete set (the set of possible APs) and a multi-dimensional 

'IWF is originally proposed in 1151 for DSL network, and subsequently 
applied to wireless networks. See e.g., 1161 and the references therein. 

^see 1171 for detailed discussion for the possibility that this scenario might 
lise in actual CRN implementations 



continuous set (the set of feasible power vectors). Although 
non-cooperative game theory has recently been extensively 
applied to solve the resource allocation problem in CRN (e.g., 
lfT3l and llT4l and the reference therein), our formulation is 
considerably different and more involved because of such 
"hybrid" nature of the strategy space of the game. We analyze 
in detail the equilibrium solution of the game, and develop an 
algorithms with provable convergence guarantees that enables 
the CUs to distributedly compute the equilibrium solution. Fi- 
nally, we suggest various extensions of our original algorithm 
based on practical considerations. 

We organize the paper as follows. In section we present 
the system model and formulate the problem into a non- 
cooperative game. In section HUl we analyze the properties of 
the NE. In section IIVI we provide our main algorithm and its 
convergence properties. In section |V] we provide extensions of 
the JASPA algorithm. We present simulation results in section 
and conclude the paper in section IVIII 

II. Problem Statement and System Model 
A. Considered Network and Some Assumptions 

We consider the following cognitive network configuration. 
Suppose there are a set {1, • • • ,N} =M CUs, {1, • ■ • , K} = 
K. channels and {1, • • • , = W APs in the network. Each 
AP w E W is assigned with a subset of channels /C^ C /C. We 
focus on the uplink scenario where each CU wants to connect 
to one of the APs for transmission. 

The followings are our main assumptions of the network. 
A-1) Each CU i is able to associate to all the APs, and each 
AP covers entire area of the network. 

A-2) The APs covering the same area operate on non- 
overlapping portions of the available spectrum. 
A-3) The set of available spectrum can be used exclusively by 
the CRN, for a relative long period of time. 
A-4) Each CU can associate to a single AP at a time; it can 
concurrently use all the channels of the associated AP. 

Assumption A-1) is made merely for ease of presentation, 
and our work can be extended to the scenarios where different 
APs cover different areas of the network, and where the CUs 
can only connect to the subset of APs that cover them. 

Assumption A-2) is commonly used when considering AP 
association problems in WLAN (for example, in lfT2l '). and it 
is made to mitigate interference between neighboring APs. It 
can be achieved either by 1) the APs agree offline the partition 
of the spectrum, or 2) the APs jointly run a distributed online 
spectrum assignment algorithm similar to the ones proposed 
in Is) to determine the best spectrum assignment. Assumption 
A-1) and A-2) imply that ICqf]ICu, = 9, y q^w,q,w e W. 

Assumption A-3) can be achieved either under the spectrum 
property right model in which the licensed networks sell or 
lease the spectrum to the cognitive network for a period of 
time for exclusive use, or under the situation that the cognitive 
network exploits relative static spectrum white spaces unused 
by local TV broadcast HS). 



B. System Model and A Non-Cooperative Game Formulation 

Let {\hi^m{k)\'^} keic^,. be the set of power gains from CU 
i to AP w on all its channels; Let {«u,(fc)}feeK:™ be the set 
environmental noise powers on all channels for AP w; Let the 

X 1 vector a denote the association profile in the network, 
with its i"* element a(i) ~ w indicating that CU i is associated 
to AP w. Each CU i is able to obtain its own channel gains 
to all the APs, {\K,n{k)\'^}k eKu,,weW' via feedback from the 
APs, but it does not need to have the knowledge of other CUs' 
channel gains in the network. 

Let Pi.w{k) represent the amount of power CU i trans- 
mits on channel k when it is associated with AP w; Let 
Pi, to = {Pi,w{k)}f^^f(- be the power profile of CU i when it 
is associated with AP w; let p-i,™ be the joint power profiles 
of all the CUs other than i that is associated with AP w: 
P-i^w = {Pj.w}j:j=^i,sL{j)=w By construction, for all w eW, 
if w ^ a{i), then pi ^j = 0. The power profiles of the CUs 
must also satisfy the following two constraints: 1) Total power 
constraints: J^keK Pi,w{k) < Pi, V i, where pi is the power 
limit for CU i; 2) Positivity constraints: Pi^w{k) > V i, fc. As 
such, each CU's feasible power allocation when it is associated 
with AP til can be expressed as: 
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|pi,«. : X! Pi,w(^) ^P>-' Pi,w{k) > 0, V fc e /C„|. 



Assume there is no interference cancelation performed at the 
AP, and the interference caused by other CUs are treated as 
noises by each CU. As mentioned in 12J, this assumption 
is reasonable considering the lack of coordination among 
the CUs. It further allows for the implementation of low- 
complexity single-user decoders on the AP. Given this as- 
sumption, for a fixed AP association and power allocation 
configuration, CU z's uplink transmission rate (when it is 
associated with AP w) can be expressed as follows: 



fee/c. 



\ht,w{k)\^Pi,w{k) 



Mk) + ErM,)= 
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log 1 



|fet,t«(fc)PPi.u.(fc) 

riwik) + Ii{k) 
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(2) 



where Ii{k) denotes the aggregated received transmission 
power level on channel fc except CU i, i.e.. 



Ii{k) = |/ij_„(fc)pPj,„(fc), li^w^{Ii{k)} 



(3) 



j:a{j)=w,j^i 



Clearly, if w = a(i), then {Ii{k)} j^^/^ can be viewed as the 
set of aggregated interference currently experienced by CU i; 



if w ^ a(i), {Ii{k)}i^^iQ can also be viewed as the set of 
aggregated interference for CU i if it were to switch to AP w. 

We see that ([T]i and are equivalent definitions of the 
CU i's transmission rate. We will use either definition in the 
following paragraph depending on the context. 

We model each CU i as selfish agent, and its objective is to 
find strategy {w*,p*.^^,) that maximizes its transmission rate: 



(w*,p*,„.) g argmax max Rt{pt,io,P-i,w;w). (4) 
We are now ready to define a non-cooperative game Q: 

Q = W, {x^}^e^r, {i?.}teAA} (5) 

where the CUs i G TV are the players in the game; 
each CU's strategy space can be expressed as Xi — 
Umgw -^i-™}' ^^^^ CU's utility function is its transmis- 
sion rate Ri{pi,.u,,p^i,ii,;w) as defined in We emphasize 
that each feasible strategy of a player in the game Q contains 
a discrete variable and a continuous vector, which makes the 
game Q unique to (and thus more complicated than) most 
of the games considered in the context of network resource 
allocation. We refer to the strategy space {xi} of this game 
as hybrid strategy space. 

The NE of this game is defined as the tuple 
ja*(i),p* > such that for all i e A/" the following 
set of equations are satisfied: 



*i*(*):Pi,a*(t) ) eargmax max i?,;(p,;^^, pi,; w) (6) 



or equivalently, V i G TV, we W, Pi^iu G ^i,™, 

^t(P*a*(t).P-»,a*(t);a*(0) > R^{P^AV,PU^^a;w). 

We call the equilibrium profile a* a NE association profile, and 
Pa* — {Pi a.-' (i)} iej\f ^ power allocation profile. In order 
to avoid duplicated definitions, we call the tuple (a* , p* . ) a 
joint equilibrium profile (JEP) of the game Q (instead of a 
NE). It is clear from either of the above definitions that in 
a JEP, the system is stable in the sense that no CU has the 
incentive to deviate from either its AP association or its power 
allocation. 

III. Properties of the JEP 

In this section, we introduce the notion of the potential 
function, and characterize its relationship with the JEP. We 
then prove that the JEP always exists for the game Q. The 
proof of Lemma[T]can be found in 111 91 . which is an extended 
version of this paper 

Let us consider a simpler problem in which the association 
vector a is predetermined and fixed. In this case, the CUs do 
not need to choose their AP associations, thus the problem of 
finding the JEP defined in ^ reduces to the one of finding 
the NE power allocation profile p* that satisfies: 

P*.amGarg max R,{p^,p%^^^^y, a{i)). (7) 

For a specific AP w, denote the set of CUs associated with it 
to be Mw- Mw ^ {i ■■ a(i) = w}. We use p.^ = {p,>,}jg^^ 
to denote the long vector containing the power profiles of all 
CUs associated with AP w. When a is fixed, the activity of 
the set of CUs TVu, w G W does not affect the activity of 
the set of CUs TVj, q G W,q ^ w, because AP w,q operate 
on different sets of channels: JC^ C\lCq ~ 0. Consequently, 
the original game Q introduced in (|5]i can be decomposed into 



W independent small games with each small game 

defined as: 



From the standard theory regarding to the existence of the NE, 
it is straightforward to see that there exists at least one NE 
power allocation p^(a) for each game Q^. In order to further 
characterize the NE power profile p^(a) of the game Q'^, we 
first introduce the notion of a potential function. 

Definition 1: The potential function of the game under 
a feasible power profile Tpw is defined as: 

P^(p.u,;a)= ^ log(n^{k) + ^ |/ij,^,(fc)ppi,t„(fc) 



keic,, 



The system potential function under a specific a and a feasible 
p is defined as the sum of the potential functions associated 
to all games {Qf,}wev\>: ^'(p; a) = EtuewP™(P^"J 
Clearly, for fixed a, P^j {y>w ; a) is a concave function 
and hence has a unique maximum point. Define J-^, = 
YiieM -^i-'w the joint feasible set for the CUs that are 
associated with AP w under the association profile a, and 
let J""^ = Uwew^w- Let £:^(a) denote the set of all NE 
power profiles for the game G^, then £{a) = Hwew ^^i^) 
is the set of all NE power profiles for the game Q under fixed 
association profile a. Let p^(a) G £to(a) and p*(a) 6 £{sl), 
then we have the following lemma regarding to the relationship 
between p*„(a), p*(a) and the potential functions. 

Lemma 1: For fixed a, a feasible p.(„ G Tf^j maximizes the 
potential function (pw ; a) ;/ and only if it is in the set 
fu)(a). We define the unique maximum value of the potential 
function as the Equilibrium Potential (EP) for AP w under 
association profile a.: Ptu(a) = maxp^gjra. P^(piu;a). 

For a fixed a, a feasible p E J-^ that maximizes the system 
potential function P(p; a) iff it is in the set f (a). Similarly as 
above, we refer to the unique maximum value of the system 
potential function as the System Equilibrium Potential (SEP) 
under a, and denote it by P{a): P{a) — X^toew «'(^)' 

We are now ready to discuss the existence of the JEP of the 
game Q as defined in (|6]l. We emphasize here that determining 
the existence of the JEP (which is a pure NE) for the game Q is 
by no means a trivial proposition. Due to the hybrid structure 
of the game Q, the standard results on the existence of pure 
NE of either continuous or discrete games can not be applied. 
Consequently, we have to explore the structure of the problem 
in proving the existence of JEP for the game Q. We have the 
following theorem regarding to the existence of JEP. 

Theorem 1: The game Q always admits a JEP. An as- 
sociation profile a G argmaXaP(a), along with any one 
of its corresponding NE power allocation profile p*(a) = 
1 P?*afi~i 1 ^ ^(a), constitute a JEP of the game Q. 

Proof: We prove this theorem by contradiction. Suppose a 
maximizes the system potential, but a is not a NE association 
profile. Then there must exist a CU i who prefers w ^ w. 
Define a new association profile a as: a(j) = a(j) except 
for the i*'' entry, in which a(i) — w. Let p*(a) G f(a), and 



p* (a) G f (a). The maximum rate CU i can get after switching 
to w if all other CUs do not change their actions is: 



(8) i?j(pj,ffi,p^(a);u;) 



keKi 



loe 



nffl(fc)+/*(/c) 

= A3(p»,t3,Pffi(a);a) - Pcs{p~{a);a) (9) 

where I*{k) is defined similarly as in (|3]l, and the vector pi ^j 
is defined as: p,;,^ = argmaxp.gjr^^ i?i(pj, pl(a); w). We 
can view the rate i?i(p,ijjr, pl(a); w) as CU i's estimate of 
the maximum rate it can get if it were to switch to AP w. 

Because CU i prefers w, from the definition of the JEP (|6]l 
we see that its current communication rate must be strictly 
less than its estimated maximum rate, i.e.: 

^»(P,*5;(a),plj,5;(a); w) < .R,(pj,i;, pi;(a); w) (10) 

where i?i(p* - (a), p^^ ^) is the actual transmission rate 
for CU i in the association profile a: 

i?.(p,, Ja), p_,„-(a); ^) ^ J^^ log (l + -^^Zg^y^j^) 

= Pis{p*{s{a.);a) - ^^(pl^ ,j;(a);a). (11) 

Combining (|9]l, ( fTOl i and (fTTT i we must have that: 

^5;(pSi(a);a)-P5(pl,^^(a);a) < 

-Pi5(Pj,-ffi,PSj(a);a) - Pffi(p^(a);a). (12) 

Notice that the term Pw{p*_i ~(a); a) is equivalent to the term 
Pis{p*^i i;(a);a), due to the equivalence of the following sets: 

{j ■ i hMj) = w} = {j : j ^ i,a{j) = w}. (13) 

Recall that Lemma [T] says the NE power allocation profile 
maximizes the potential function when a is fixed: p~(a) G 
max gjra P{5(p^;; a). Observe that the set of CUs associated 
with AP w under profile a is the same as the set of CUs 
associated with AP a under profile a excluding CU i, we must 
have plj ^(a) € J--. Consequently, the following is true: 

Ps(p^(a);a) > Ps(P-,,s(S); S) Ps(P-,,s(S); 5) (14) 
where (a) is from (T3[ . Similarly, we have that: 

-Pffi(Pffi(a);a) > Pffi(p,;,ffi,p^(a);a). (15) 
Combining (fl4l i. (fTSI ) and (fT2] |. we have that: 

■Pi5(pS;(a);a)+Pffi(Pffi(a);a) < 

Pfi;(pl(a); a) + P„~(p,l(a); a). (16) 

Noticing that the equilibrium potentials of all the APs other 
than w and w are the same between the profile a and a, thus 
adding them to both sides of ( fTSI l we have that: 



P^(p;(a);a) 



(17) 



which is equivalent to: P(a) < P(a). This is a contradiction 
to the assumption that P(a) is the maximum system potential. 
We conclude that a must be a NE association profile. Clearly, 
p*(a) is a NE power allocation profile. Consequently, we have 
that (a, p*(a)) is a JEP. ■ 



IV. The JASPA Algorithm and Its Convergence 

In this section, we first introduce an algorithm that assigns 
the CUs to their closest AP. This algorithm, although simple 
and inefficient, offers valuable insights upon which we build 
our first algorithm, called the Joint Access point Selection and 
Power Allocation (JASPA) algorithm, in subsection IIV-BI 

A. The Closest AP Association Algorithm 

Consider a fixed AP association profile a in which each CU 
is assigned to its closest AP. The "closeness" from a CU to 
the APs can be measured either by the physical distance, or 
by the strength of pilot signals received by the CU from the 
APs. Assuming that each CU has a single closest AP (ties are 
randomly broken), then the AP association profile is fixed and 
the computation of JEP reduces to the problem of finding the 
NE power allocation profile. Clearly, this scheme separates the 
process of spectrum decision and spectrum sharing, and the 
CUs only need to carry out the task of sharing the spectrum 
available to the designated AP with other CUs. However, 
as we probably can speculate, no matter how efficient such 
sharing scheme is, the overall system performance might suffer 
because of the fixed and inefficient AP assignment. We will see 
such performance degradation later in the simulation section. 

Nevertheless, we introduce two propositions stating two iter- 
ative algorithms that enable the CUs to distributedly compute 
the NE power allocation profile under the fixed a. We refer 
the readers to |fT9l for the proofs. 

Proposition 1: For a fixed association profile el, if in each 
iteration t, the CUs in the set Mw iteratively do the following. 
1 ) Calculate the best reply power allocation: 



nn,{k) + ll{k) 



l/l«,»(fc)|2 



(18) 



where cri ensures J^keK. ^ii^ii^)) ~ Pi' ^'^^ ^ii^ w) 
2) Adjust their power profiles according to: 



P'i^ = (1 - at)pl,n, + Qt*i(llu,) 
where the sequence {at}'^i satisfy at G (0, 1) and: 

T T 

lim at = 0, lim at = co, lim a? < oo. 



(19) 



(20) 



Then the CUs' individual power profiles converge to a NE 
power allocation profile, i.e., limt_j.oo Pi w = Pi V i G Afw, 
and {Pi w}ie^f^ G ftu(a). We name the above algorithm the 
Averaged Iterative-Water Filling (A-TWF). 

Proposition 2: If in each iterative t, the CUs in the set Mw 
adjust their power profiles sequentially according to: 



* d* ) 



(21) 



then their individual power profiles also converge to a NE 
power allocation profile. We call the above algorithm the 
Sequential Iterative-Water Filling (S-IWF). 

^By "sequential" we mean that the CUs in the set Afw take turns in changing 
their power allocation, and only a single CU gets to act at time t. All other 
CUs j ^ i,j G Nw keep their power allocation as in time t — I. 



From Proposition [T] and |2] we conclude that for a specific 
association profile a, all CUs i £ M are able to distributedly 
decide on their NE power allocation profiles by running 
either the A-IWF or the S-IWF algorithm. Several comments 
regarding these algorithms are in order 

1) In order to calculate ' sach iteration in- 
dividual CU only needs to know the aggregated interference 
plus noise (IPN) contributed by all other CUs on the channels 
of selected AP w, {nw{k) + Ii{k)}i.^f^ , and this information 
can be fed back to the CUs i G Afw by the AP w. 

2) Consider a single AP w. We have shown in |fT9l that when 
the number of channels becomes large, or equivalently the 
portion of the spectrum belongs to this AP is very finely 
divided, then the NE power allocation profile maximizes the 
sum capacity of the AP. In another word, the NE is efficient. 
Similar observation has been made in |2], where the authors 
proved that the NE of a fading multiple-access water-filling 
game achieves capacity. This somewhat surprising result, that 
selfish CUs by distributedly allocate their power can achieve 
system capacity, provides justification for the distributed spec- 
trum sharing scheme analyzed in this work. 

B. The Joint AP Selection and Power Allocation Algorithm 

We name the proposed algorithm Joint Access Point Se- 
lection and Power Allocation (JASPA) algorithm. Intuitively, 
the JASPA algorithm works as follows. For a fixed AP 
association profile, all CUs calculate iteratively their NE power 
allocations. After convergence, they individually try to see 
if they can strictly increase their communication rates by 
switching to another AP, assuming that all other CUs keep 
their current AP associations and power profiles. When CU 
i decides that its next best AP association should be w*, we 
record his decision by a x 1 best reply vector hi : = e„,* , 
where ej denotes a W x 1 elementary vector with all entries 

except for the 7*'' entry, which takes the value 1. In the next 
iteration, CU i's AP association decision is made according 
to a Vl^ X 1 probability vector /3*, which is properly updated 
in each iteration according to b;. We also suppose that each 
CU has a length M memory, operating in a first in first out 
fashion, that records its last M best reply vectors. 

The proposed algorithm is detailed as follows. 

1) Initialization: Let t=0, CUs randomly choose their APs. 

2) Calculation of the NE Power Allocation Profile: Based 
on the current association a*, all the CUs calculate their 
NE power allocations p*(a*), either by A-IWF or S-IWF 
algorithm. 

3) Selection of the Best Reply Association: Each CU 

1 talks to all the APs in the network, obtains necessary 
information in order to find a set of APs such that all 
w G Wf satisfies w ^ a*(i) and: 

max i?j(pi,u,,pj;,(a*);ui) > i?j(Pi(a*)iP-i(a*);a*(«)). (22) 

If W| 7^ 0, obtain the w* G W* that can offer the maximum 
rate (ties are randomly broken); otherwise, let w* ~ a*(i). Set 
the best choice vector b*+^ = e,,,*. 



4) Update Probability Vector: For each CU i, update the 
W X 1 probabiHty vector /3- according to: 



/3r 



/3^ + iT(br 



if M < t 
if M >t>0 
if t = 0. 



(23) 



Shift b*''"^ into the end of the memory; shift b* out from 
the front of the memory if t > A/. 

5) Determine the Next AP Association: Each CU i samples 
the AP index for association at iteration t+l based on /J*"*"^: 



t+i 



(24) 



where multi{.) represents a muhinomial distribution. 
6) Continue: Let t=t+l, and go to Step 2). 
We make the following remarks about the above algorithm. 

Remark 1: It is crucial that each CU finally decides on 
choosing a single AP. Failing to do so will result in system 
instability, in which the CUs switch AP association indef- 
initely, and much of the system resource will be wasted 
for closing old connections and establishing new connections 
between the APs and CUs. Specifically, it is desirable to have 
limt_>oo /3i = V z e Af, where /3* is an elementary vector 
with a single entry 1, and all other entries 0. 

Remark 2: The best reply vectors b*^^ are decided 
in each iteration based on the other CUs' AP associ- 
ations and power profiles in the previous iteration. It 
can be straightforwardly shown that in order to calculate 
maxp.^gjr.^ i?j(pj^, p*,(a*); w) for different w e W, in- 
dividual CU i does not need to know the strategies of all 
other CUs in the network, nor does it need to know the 
system association profile a*. Instead, it only requires the 
aggregated IPN on each channel from each AP of the last 
iteration. This is precisely the necessary information needed 
for finding the set W* in Step 3) of the JASPA. This property 
of the algorithm contributes to the reduction of the amount of 
messages exchanged between APs and each CU when making 
association decisions. 

Remark 3: Considering the overhead regarding to end an 
old connection and re-establish a new connection, it is reason- 
able to assume that a selfish CU is unwilling to abandon its 
current AP if the new one cannot offer significant improvement 
of the data rate. We can model such unwillingness of the CUs 
by introducing a connection cost Ci > 0, which is a private 
parameter for each CU i. A CU i will only seek to switch to a 
new AP if the new one can offer rate improvement of at least 
Ci, i.e., it will only switch to those APs w G W* that satisfies: 

max i?i(pj.u,,p^ (a* );-!«) > i?j(p*(a*), plj(a*); a*(i)) + q 

From a system point of view, such unwillingness to switch 
by the CUs might contribute to improved convergence speed 
of the algorithm, but might also result in reduced system 
throughput. These two phenomenons are indeed observed in 
our simulations, please see section [VT] for examples. 



C. Global Convergence of the JASPA algorithm 

In this subsection, we prove that our algorithm converges to 
a JEP globally, i.e., the algorithm converges regardless of the 
initial starting points or the realizations of the channel gains. 
Due to space limit, we refer the readers to llT9l for the proofs 

of the Proposition [3] 

We first introduce some notations. Define a set A as follows: 



a. & A -i^^ a appears infinitely often in {a'}. 



(25) 



We state a proposition characterizing the set A. 

Proposition 3: Let M > N. Then at least one element 
in the set A say a*, is a NE association profile. Moreover, 
(a*,p*(a*)) is a JEP (satisfy equation ©j. 

Using the results in Proposition |3] we obtain the following 
convergence results. 

Theorem 2: Let M > N. Then the JASPA algorithm pro- 
duces a sequence {(a*, p*(a*))}j^j^ that converges to a JEP 
(a*,p*(a*)) with probability L 

Proof: We first show that the sequence {a*}^^^ converges 
to an equilibrium profile a* . Notice that if at time T, = a* , 
and in the next M iterations, we always have a-^+* = a*, then 
the algorithm converges. 

Let A* & A contains all the NE association profiles 
in A. Let {a''^*'' : > 1} be the infinite subsequence 
satisfying a*'*"'' G A*. Without loss of generality, assume 
t{k) — t{k — \) > M. Let us denote by Ck the event 
in which the process converges to a a* e A*, after a 
sequence of best replies equals to a* of length M occurs, 
starting at time t{k): Ck = nf£i{a*^''^+' = a*}. Note, 
Pr{Ck+i\C^) > ijj)^^^^, because whenever a* appears, 
each CU i's best reply should be a*(z), hence a*(z) will 
be inserted into the last slot of CU I's memory. Then with 
probability {jj)^, all CUs sample the last memory and a* 
will appear in the next iteration. Thus, 



< lim (1 

T-foo 



< 1 \nx M 



fc=l 

T-1 



(26) 



This says Pr(a* converges to a a* G A* eventually) = 1. 
Finally, because p*(a*) G ^^(a*) is a NE power allocation 
profile, we conclude that (a*,p*(a*)) is a JEP. ■ 
Now that we have shown the convergence of the JASAP 
to the JEP, it is of interest to evaluate the "quality" of such 
equilibrium. In this work, we use the system throughput to 
measure the quality of the JEP, and our simulation results (to 
be shown in section IVIl i are very encouraging. 

V. Extensions to the JASPA Algorithm 

The JASPA algorithm presented in the previous section is 
"distributed" in the sense that the computation that each CU 
needs to carry out in each iteration only requires some lo- 
cal/summary information, i.e., the aggregated IPN at different 
APs in different channels, and the CU's own channel gain. 
However, this algorithm requires that for each AP associ- 
ation profile a*, an intermediate equilibrium p*(a*) should 
be reached (in Step 2), and the CUs cannot choose their 



next AP association profile until the system reaches such 
equilibrium. This requirement poses a relatively strong level 
of coordination among the CUs, which is not very desirable 
for a distributed algorithm. 

In this section, we propose the following two algorithms 
that do not require the CUs reach any intermediate equilibria; 
1) a sequential version of the JASPA algorithm (Se-JASPA) 
in which CUs act one by one in each iteration, and 2) a 
simultaneous/parallel version of the JASPA algorithm (Si- 
JASPA) in which CUs act at the same time. 

The Se-JASPA algorithm is detailed in Table |T] 



1) Initialization (t=0): Each CU randomly chooses a'\i) and p'^j^o(i) 

2) Determine the Next AP Association: 

If it is CU i's turn to act, (e.g., {(t + l)modeAf} + 1 = i), then CU i 
finds a set W* s.t.: 

It selects an AP by randomly picking w* £ W* and setting a'+^(j) = w 
For other CUs j ^ i, a*+-'-(j) = a*(j) 

3) Update the Power Allocation: 

Denote w* = a*+^(«). Then CU i calculates p*^^ as 

^» ~\ argmaXp.^.gjF.^, i?i(pi_„.,pi._^. ;«)*), othei-wise 
For other CUs j i, p^+^ = pj 

4) Continue: Let t=t+i, and go to Step 2) 

TABLE 1 
The Se-JASPA Algorithm 

We partially characterize the convergence behavior of Se- 
JASPA algorithm in the following theorem, the proof of which 
can be found in llT9ll . 

Theorem 3: The sequence {P(p*, a*)}^]^ produced by the 
Se-JASPA algorithm is non-decreasing and converging. 

We see that the Se-JASPA algorithm differs from the JASPA 
algorithm in several important ways. Firstly, a CU i does not 
need to keep its best reply vector b* as it does in JASPA. It 
decides on its AP association greedily in step 2). Secondly, 
a CU i, after deciding a new AP a*+^(i) = w* , does not 
need to go through the process of reaching an intermediate 
equilibrium with all other CUs to obtain p*(a*'''^). However, 
the CUs still need to be coordinated for the exact sequence 
of their update, because in each iteration only a single CU is 
allowed to act. Such order of update can be agreed upon and 
enforced by the APs in the network. As might be inferred by 
the sequential nature of this algorithm, when the number of 
CUs is large, the convergence time becomes long. 

The Si- JASPA algorithm, as detailed in Table HI] overcomes 
the above difficulties arise in Se-JASPA. We note that in the 
algorithm, the variable Ti represents the duration that CU i has 
stayed in the current AP, and the stepsize at satisfies ( l20l i. 

The structure of the Si-JASPA is almost the same as the 
JASPA except that each CU, after switching to a new AP, 
does not need to go through the process of joint computation 
of the intermediate equilibrium solution with all other CUs 
currently associated with the same AP: they can choose 
their AP "continuously". The level of coordination among 
the CUs required for this algorithm is minimum among all 
the three algorithms. The simultaneous update required by 
this algorithm can be realized by either one of the following 



1) Initialization (t=0): Each CU i randomly chooses a'^i) and p'^^O(j) 

2) Selection of the Best Reply Association: 

Each CU obtains the AP w* and set b'+^ following Step 3) of JASPA 

3) Update Probability Vector: 

Each CU i updates the probability vector /3' according to (23) 
Shift b*+^ into the memory; shift b*^'*^ out of memory it t> M 

4) Determine the Next AP Association: 

Each CU i samples the AP index for association as in \2A\ 

5) Compute the Best Reply Power Allocation: 

Let w^f^^ = a'+^(j). Each CUs i calculates p* as 

6) Update the Duration of Stay: 

Each CU i maintains and updates a variable Ti : 
^ _ / 1 ifa*+i(i)7^a*(i) 
^ ~ \ Ti + 1 if a*+i(i) = a*(i) 

7) Update the Power Allocation: 
Each CU i calculates p'^^ as follows: 

„t+i_/ P* ' ifa'+i(i)^a*(i) 

P« "1 (l-aTjp*+c.T.P* if a*+i(i) =a'(i) 

8) Continue: Let t=t+l, and go to Step 2) 

TABLE II 
The Si-JASPA Algorithm 

approaches: 

1) The APs agree upon the update interval off-line. Each CU 
is equipped with a timer. The first time a CU comes into the 
system, it is informed by its initial associated AP the update 
interval and the next update instance. Then the CU can perform 
update on its own. 

2) The APs agree upon the update interval off-line, and they 
alert the CUs associated with them the update instances by 
broadcasting. 

Extensive simulations confirm that generally this algorithm 
converges faster than the Se-JASPA. 

VI. Simulation Results 

In this section, we present simulation results to validate the 
proposed algorithms. For each experiment we show the results 
obtained by running either Si-JASPA and Se-JASPA, or by the 
original JASPA. 

We have the following general settings for the simulation. 
We place multiple CUs and APs randomly in a 10m x 10m 
area; we let di^w denote the distance between CU i and 
AP w, then the channel gains between CU i and AP w, 
{|/ii,tu(fc)|^}fceK„> are independently drawn from an exponen- 
tial distribution with mean . We let the available channels 
to be evenly pre-assigned to different APs. We let the length of 
the individual memory to be M = 10. For ease of presentation 
and comparison, when we use the JASPA algorithm with 
connection cost, we set all the CUs' connection cost {ci]i^js/ 
to be identical. In the following, when we say a "snapshot" of 
the network, we refer to the network with fixed (but randomly 
generated as above) AP, CU locations and channel gains. 

We first show the results regarding to the convergence of 
the algorithm. We only show the results for Si-JASPA and Se- 
JASPA in this experiment. We first consider a network with 
20 CUs, 64 channels, and 4 APs. Fig. [T] shows the evolution 
of the system throughput as well as the values of the system 
potential function generated by a typical run of the Se-JASPA, 
Si-JASPA and Si-JASPA with connection cost q = 3 bit/sec. 



We observe that the two Si-JASPA based algorithms converge 
very fast, while the Se-JASPA converges very slowly. After 
convergence, the system throughput achieved by Si-JASPA 
with connection cost is smaller than that of the other two 
algorithms. Notice that in the bottom part of Fig. [T] the system 
potential generated by the Se-JASPA is non-decreasing with 
respect to the iterations. This phenomenon has been predicted 
in Theorem [3] 




-Si-JASPA 
■-Se-JASPA 
- Si-JASPAco: 



-Si-JASPA 
-Sa-JASPA 
-Si-JASPA CO! 



Fig. 1. Convergence of different algoritlims. Top: evolution of system sum 
rate. Bottom: evolution of the value of system potential function. 





Fig. 2. Convergence of Selected CUs' AP selection. 

Fig. [2] shows the evolution of the AP selections made by 
the CUs in the network during a typical run of the Si-JASPA 
algorithm. We only show 3 out of 20 CUs (we call the selected 
CUs CU 1, 2, 3 for easy reference) in order not to make 
the figure overly crowded. Fig. |3] shows the corresponding 
evolution of the probability vectors {/3*}t£° for the three of 
the CUs selected in Fig.|2l It is clear that upon convergence, all 
the probability vector converges to an elementary vector We 
then evaluate how the number of CUs in the network affects 
the speed of convergence of different algorithms. In order to do 
so, we compare the average iterations to achieve convergence 
in the network with 4 APs, 64 channels and different number 
of CUs, for the following three algorithms 1) Si-JASPA, 2) 
Se-JASPA, 3) Si-JASPA with connection cost d = 3 bit/sec 
for all CUs. From Fig|4] we see that when the number of 
CUs in the system becomes large, the sequential version of the 
JASPA takes significantly longer time to converge than the two 
simultaneous versions of the JASPA algorithm. We can also 
see that the connection costs adopted by individual CUs indeed 
have positive effects on the convergence speed of the system. 
Note each point in this figure represents the average of 100 
independent runs of each algorithm on randomly generated 
network snapshots. 

We subsequently evaluate the network throughput perfor- 
mance achievable by the JEP computed by the JASPA. 




Fig. 4. Comparison of averaged convergence speeds. 




Fig. 5. Comparison of averaged system throughput by different algorithms 
with the throughput upper bound in a 8 CU network. 

We first investigate a small networks with 8 CUs, 64 
channels and 1, 2, 3, 4 APs, and compare the performance of 
JASPA related algorithms to the maximum network throughput 
that can be achieve for the same network. The maximum 
network throughput for a snapshot of the network is calculated 
by the following two steps: 1) for a specific AP-CU association 
profile, say a, calculate the maximum network throughput 
(denoted by T{a.)) by summing up the maximum capacit}]^ of 
individual APs in the network; 2) enumerate all possible AP- 
CU association profiles, and find T* ~ maxar(a). We see 
the reason that we choose to focus on such relatively small 
networks in this experiment is that for a large network, the 
time it takes for the above exhaustive search procedure to find 
the maximum network throughput becomes prohibitive. The 
result is shown in Fig|5] where each point in the figure is 
obtained by averaging the results obtained by the algorithms 
on 100 independent snapshots of the network. We see that 
the JASPA algorithm performs very well with little throughput 
loss, while the closest AP algorithm, which separates the tasks 
of spectrum decision and spectrum sharing, performs poorly. 
We then start to look at the performance of larger networks 
with 30 CUs, up to 16 APs and up to 128 channels. Fig. |6] 
shows the comparison of the performance of JASPA, JASPA 
with individual cost ~ 3 bit/sec and Ci = 5 bit/sec, and 
the closest AP algorithm. We adopt the actual distance as the 
measure of "closeness" in the closest AP algorithm. Each point 
in this figure is the average of 100 independent runs of the 
algorithms. Due to the prohibitive computation time required, 
we are unable to obtain the maximum system throughput 
for these relatively large networks. We instead compute the 
equilibrium system throughput that can be achieved in a non- 

^For a single AP with fixed number of users and channel gains, the 
maximum capacity is the well-known multiple access channel sum capacity. 
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Fig. 6. Comparison of the system throughputin a 30 CU network. 

cooperative game ;/ all CUs are able to connect to multiple 
APs at the same time. We refer to this situation the K- 
connectivity network (the "K-Connectivity" network is similar 
in spirit to the "multi-homing" WLAN studied in 1(121 ). It 
is clear that in the K-connectivity case, there is no need for 
the CUs to perform the AP selection algorithm, and the CUs 
in this network enjoy the flexibility of being able to connect 
to multiple APs at the same time. However, we observe that 
the performance of JASPA is very close to that of the "K- 
Connectivity" network. 

From Fig.|6]we see that when the number of APs increases, 
the throughput of the JASPA algorithm becomes much better 
than the closest AP algorithm, a phenomenon that is partly due 
to the fact that for the closest AP algorithm, the separation 
of the AP selection and power allocation process results in 
the insufficient use of the spectrum: when the number of AP 
increases, it becomes increasingly more probable that several 
APs are idle because no CUs are close to them. Fig. |6] along 
with Fig. m also serve to confirm our early speculation that 
there indeed exists tradeoff of convergence speed and system 
throughput between JASPA and JASPA with connection cost. 

VII. Conclusion 

In this paper, we addressed the joint AP association and 
power allocation problem in a CRN, and formulate it into a 
non-cooperative game with hybrid strategy space. We charac- 
terized the NE of this game, and provided distributed algo- 
rithms to reach such equilibrium. Empirical evidence gathered 
from simulation experiments suggests that the equilibrium has 
very promising quality in term of the system throughput. 

The problem analyzed in this work, particularly the game 
with mixed strategy space, can be extended to solve many 
other problems, for example, the CRN with interference chan- 
nel and segmented spectrum mentioned in section II-AI It will 
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also be our future research topic to analyze the effect of 
random arrivals and departures of the CUs on the performance 
of the algorithm, and propose suitable heuristic dealing with 
these situations. 
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